Automatic Assessment of the Speech of Young English Learners
نویسندگان
چکیده
This paper introduces some of the research behind automatic scoring of the speaking part of the Arizona English Language Learner Assessment, a large-scale test now operational for students in Arizona. Approximately 70% of the students tested are in the range 4-11 years old. We cover the methods used to assess spoken responses automatically, considering both what the student says and the way in which the student speaks. We also provide evidence for the validity of machine scores. The assessments include 10 open-ended item types. For 9 of the 10 open item types, machine scoring performed at a similar level or better than human scoring at the item-type level. At the participant level, correlation coefficients between machine overall scores and average human overall scores were: Kindergarten: 0.88; Grades 1-2: 0.90; Grades 3-5: 0.94; Grades 6-8: 0.95; Grades 9-12: 0.93. The average correlation coefficient was 0.92. We include a note on implementing a detector to catch problematic test performances.
منابع مشابه
Pragmatic Criteria in the Holistic and Analytic Rating of the Disagreement Speech Act of Iranian EFL Learners by Non-native English Speaking Teachers
onveying a strong message within a language stems from not only a linguistically appropriate utterance but also a pragmatically appropriate discourse. Broadly considering various facets of pragmatics, pragmatic assessment has not been potentially brought into perspective. To address this discourse gap, this study, guided by the principles of mixed-method design, pursued three purposes: ...
متن کاملEnglish and Non English major Teachers’ Assessment of Oral Proficiency: a case of Iranian Maritime English Learners
Speaking assessment is still construed as a complicated, under-researched process from the vantage point of tasks and rater characteristics. The present study aimed at investigating if and how English Major and none English Major teachers differ in their perception of the construct of oral proficiency while assessing learners’ L2 oral proficiency. To this end, 38 male and female non-native EFL...
متن کاملThe Washback Effect of Task-based Assessment on the Iranian EFL Learners' Development of Pragmatic Competence
The present study was an attempt to explore the ‘washback effect’ of task-based assessment (TBLA) on EFL Iranian learners’ pragmatic development. To this end, through conducting KET (Key English Test), 60 out of 120 EFL Iranian learners studying in an English language school, were randomly selected. They were assigned to treatment group (N=30), and control group (N=30). The treatment group was ...
متن کاملRole of Monolingualism/Bilingualism on Pragmatic Awareness and Production of Apology Speech Act of English as a Second and Third Language
The present study investigated the pragmatic awareness and production of Iranian Turkish and Persian EFL learners in the speech act of apology. Sixty-eight learners of English studying at several universities in Iran were selected based on simple random sampling as the monolingual and bilingual participants. Data were elicited by means of a written discourse self-assessment/completion test (WDS...
متن کاملNative and Non-native English Teachers’ Rating Criteria and Variation in the Assessment of L2 Pragmatic Production: The Speech Act of Compliment
Pragmatic assessment and consistency in rating are among the subject matters which are still in need of more profound investigations. The importance of the issue is highlighted when remembering that inconsistency in ratings would surely damage the test fairness issue in assessment and lead to much diversity in ratings. Our principal concern in this study was observing the criteria that American...
متن کاملThe Role of L2 Private Speech in Cognitive Regulation of Adult Foreign Language Learners
The present study investigated the use of L2 private speech by English foreign language (EFL) learners in regulating their mental activities. Thirty intermediate adult EFL learners took a test of solving challenging English riddles while their voices were being recorded. Following, instances of the produced private speech were analyzed in terms of form, content, and function. Numerous instances...
متن کامل